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Abstract 

This paper describes in outline a method for translating Japanese temporal expressions 
into English. We argue that temporal expressions form a special subset of language that is 
best handled as a special module in machine translation. The paper deals with problems 
of lexical idiosyncrasy as well as the choice of articles and prepositions within temporal 
expressions. In addition temporal expressions are considered as parts of larger structures, 
and the question of whether to translate them as noun phrases or adverbials is addressed. 

1 Introduction 

In this paper we argue that the transfer and generation of temporal noun phrases and adverbials 
are best handled by a separate (though integrated) module in Japanese-to-English machine 
translation. 

There are several reasons for creating a separate module. They center on the fact that 
temporal expressions are lexically and syntactically highly idiosyncratic in both Japanese and 
English. In addition to grammatical differences, there are differences in style between the two 
language communities that need to be addressed by a machine translation system: for example 
in English stock market reports days are given as days of the week, whereas in Japanese they 
are given as dates. 

We argue that attempting to handle all phenomena by a single mechanism, while conceptu- 
ally elegant, causes unnecessary complications. In addition trying to shoehorn all idiosyncrasies 
into any one method is likely to strain it to the breaking point. Instead we adopt the multi-level 
machine translation approach of [Ikehara et al., 1991] in which there are many levels of transfer 
between two languages, and expressions are transferred at whatever level the system judges to 
be the most appropriate. 

The processing described has been implemented in the Japanese-to-English machine trans- 
lation system ALT-J/E [Ikehara et al., 1991, Ikehara et al., 1996]. 



2 Temporal Noun Phrases 

English temporal noun phrases, along with locatives, do not have all the properties of prototypi- 
cal 'purebred' noun phrases. They are one of the five types of noun phrases classed as 'defective' 
by [Ross, 1995]. For example, in contrast to 'purebred' noun phrases, they are typically pronom- 
inalized by then rather than it. There is also a great deal of variation in realization between 
dialects, with substantial differences between Australian, American and British English. 1 In 
addition, there is a lot of lexical idiosyncrasy as we will show in the following sections. 

1 There is also some dialectal variation in Japanese temporal expressions. For example, yanoasatte, which 
means the fourth day after today in the standard dialect, means three days after today in some areas. 
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In the following section we will restrict our discussion to noun phrases that show position in 
time, rather than duration or frequency. English temporal adjuncts are described in some detail 
in [Quirk et al., 1985, 526-555]. To the best of our knowledge, the only description of their 
treatment in natural language processing is that of [Flickingcr, 1996], who discusses English 
time expressions in the HPSG grammar used by the Verbmobil German/ Japanese-to-English 
machine translation project. 



2.1 Temporal Noun Phrase Structure 



English time position noun phrases used primarily to refer to time (for example within adverbials 
or the subject of sentences such as Spring has come) are highly idiosyncratic in their lexical 
choice, as well as their choice of determiners, typically having no surface determiner, although 
some take the definite article. Note that noun phrases headed by the same nouns, but not 
primarily referring to time, behave as do other nouns: It was a spring to remember. To handle 
these lexical and syntactic idiosyncrasies we introduce special processing for temporal noun 
phrases. 

[Ross, 1995, 433] claims that such defective noun phrases are always locally triggered, that 
is there is some clause mate that forces or enables the noun phrase to become defective. If we 
allow time position adjuncts to license themselves as being defective, then this claim holds, and 
we can always count on there being some trigger to introduce our special processing. 

We will now give some example of the idiosyncrasies, starting with the lack of an article for 
unmodified nouns, assuming they are in temporal noun phrases: (l). 2 

(1) a. today, yesterday, tomorrow (deictic-day) 

b. Monday (day-of -week) 

c. Christmas (holiday) 

d. 3 o'clock, 12:15 (numbered-hour) 

e. February (month) 

f. 1997 (year) 

g. dawn (time-of -day) 

h. winter (season) 



We analyze the noun phrases with no surface determiner as null determiners, phonologically 
empty determiners which appear in noun phrases with definite reference to a locatable, one- 
member referent set itself, following [Chestcrman, 1991, 73]. 3 Our analysis of null determiners 
is independently motivated, we also consider them to appear in proper names, technical terms, 
and noun phrases such as school in / went to school today. 

In contrast with the noun types shown in (1), ordinal numbers denoting the day of a month, 
on the other hand, normally take the definite article: the 19th. 

The choice of dependent also affects the choice of determiner, for example, there are some 
temporal expressions which take the null determiner when they are modified by another class 
of time expressions (2): 

(2) a. Monday morning 

2 After each example in (1) we give the semantic attribute of its head, from the hierarchy given in Figure 2. 
3 Null determiners are distinct from ZERO determiners, also phonologically empty, which appear with indefinite 
uncountable and plural noun phrases, the equivalent to a for singular countable noun phrases. 
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b. yesterday morning 

c. Monday night 

d. February 19 

e. February 19th 

We analyze these simply as "null modifier head" . In our translation module, we generate 
the determiner during the transfer from Japanese. [Flickinger, 1996] introduces a more elegant 
analysis where the first clement is treated as specifier (determiner) of the second, thus explaining 
the lack of article. Both the specifier's change in part of speech, and the head's choice of 
complement, are expressed by lexical rules. However, as null determiners or their equivalent are 
needed for the noun phrases in (1) anyway, we do not consider their use here to be problematic. 

In addition, as [Flickinger, 1996] notes, his analysis leads to noun phrases with multiple 
specifiers in expressions like February the 19th, although most analyses of English allow only 
one specifier. 

To handle such cases, we use a special structure, the special compound noun phrase, that we 
have established for noun phrases where there is no obvious head, such as person and company 
names, and addresses. We use this structure for time position noun phrases which include the 
following elements: year, day-of -month, month and/or numbered-time. In these noun phrases 
there is no obvious semantic head and there are many possible representations in English. 
We show some examples of the choice of expressions for a single date in (3). The choice of 
representation is mainly a question of style. In particular, it does depend on the Japanese 
source noun phrase, which has only two possible forms: 2-gatsu-19-nichi "2 month 19 day" and 
2-gatsu-no 19-nichi "2 month gen 19 day". 

(3) February the 19th vs February 19 vs February 19th vs the 19th of February 

By establishing a set of special structures for noun phrases that behave atypically, and thus 
have to be treated atypically anyway, we are able to preserve a uniform structure for all other 
noun phrases (with a single, although potentially phonologically empty, specifier). Although 
the grammar is consistent with 'purebred' noun phrases, the choice of determiner is not, which 
is why we argue for a separate module. 

In the next section we will describe the transfer module for temporal noun phrases, which 
does most of the hard work in creating appropriate English structures, handling lexical and 
phrasal idiosyncrasy. 

2.2 Transfer and Generation of Temporal Noun Phrases 

Temporal noun phrases in Japanese can be considered to be of three types; those headed by 
single nouns, those headed by compound nouns, and those made of one temporal noun phrase 
modifying another. 

The transfer stage for noun phrases headed by single nouns is basically a process of replacing 
them by their equivalents in the lexicon, which may be a single English noun: kind "yesterday" 
or a phrase: ototoi "the day before yesterday" . 

Dates (years, months, days of months, and numbered times) are compound nouns in Japanese, 
typically consisting of a number and a temporal noun. The compound noun rules first distin- 
guish between time positions and time periods, for example, the adverbial 13-nichi "13 days" 
could be on the thirteenth or for thirteen days. Once it has been determined that the noun 
phrase refers to time position, simple regular rules are used to generate the corresponding 
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English expressions. For example, days of months in Japanese have the form NUMERAL-nic/iz 
"NUMERAL day" and are translated into special compound noun phrases with the day-of -month 
slot filled by the value for the numeral. 4 

Complex noun phrases require more complicated rules. We show the rules for the combina- 
tion of a noun phrase denoting deictic-day or day with one denoting period-of-day (morn- 
ing, afternoon, evening) or night in Figure 1. The Japanese will be of the form a-no (3 "a-GEN 
fj" where a is headed by a day or deictic-day noun and /? is headed by a period-of-day 
noun or night, that translates into English noun phrase B. 

B is period-of-day (morning, afternoon, evening) or night: 
if A is deictic-day 
if A = issakusakujitsu "the day before the day before yesterday" 

=> the B before the B before last ; 3 before 
if A = ototoi "the day before yesterday" 

=> the B before last ; 2 before 
if A = kind "yesterday" 
if B = night 

=> NULL last night 
else 

=> null yesterday B ; 1 before 
if A — kyo "today" or honjitsu "today" 
if B = night 

=> NULL tonight 
else 

=> this B ; today's 
if A = ashita "tomorrow" 

=> NULL tomorrow B ; 1 after 
if A = asatte "the day after tomorrow" 

=> the B after next ; 2 after 
if A = shiasatte "the day after the day after tomorrow" 

=>■ the B after the B after next ; 3 after 
if A = yanoasatte "the day after the day after the day after tomorrow" 

=> the B after the B after the B after next ; 4 after [rare] 
if A is relative-day 
if A = zenjistu "the previous day" 

=> the previous B 
if A = yokujitsu "the following day" 

=> the following B 
if A is named-day 

=> NULL.4 B 

if A is day-of-month 

=> the B of the A (A must be ordinal) 

Figure 1: Rules for day, deictic-day and period-of-day 

4 This is further complicated by the fact that the Japanese have two counting systems for years, one based on 
the western system (A.D.) and the other on years of the current emperor's reign. 3-nen "3 years" is thus multiply 
ambiguous between, at least, 3 AD, 1903 AD, 2003 AD, 1991 AD (the third year of the current emperor's reign) 
and a three year period. Disambiguating these readings is outside of the scope of this paper. 
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These transfer rules capture the lexical and phrasal idiosyncrasies of the temporal noun 
phrases, and are relatively easy to test and expand. The rules are all specific to temporal noun 
phrases; no generality has been lost by putting them in a separate module. 

Note that these rules do not necessarily preserve the Japanese structure. In particular many 
temporal expressions made up of two noun phrases in Japanese are often most naturally trans- 
lated as a noun phrase and an adverbial phrase in English. For example, ashita-no akegata 
"tomorrow-GEN dawn" could possibly be translated as tomorrow 's dawn, but is generally trans- 
lated as dawn tomorrow where tomorrow is an adverbial modifying dawn. 

Attempts to preserve structure, translating all Japanese temporal noun phrases as English 
noun phrases, are problematic as the resulting expressions are often unwieldy or ambiguous. 
Consider for example kotoshi-no kurisumasu "this year-GEN Christmas" , which could potentially 
be translated as this year's Christmas, this Christmas or Christmas this year, this year's 
Christmas sounds extremely unnatural. The use of deictic dependents like this and last, are 
not bound by year boundaries; this Christmas spoken in January, could refer to Christmas last 
year, or the coming Christmas and is thus ambiguous in a way that the Japanese original is 
not. 5 In addition, there is considerable dialectal difference in the use of this and next. Thus the 
most acceptable translation of kotoshi-no kurisumasu "this year-GEN Christmas" is Christmas 
this year, where this year is an adverbial. 

We also found, in a preliminary investigation of bilingual corpora, that translation of 
Japanese temporal noun phrases into English adverbials was very common, not only in com- 
plex temporal noun phrases, but also in noun phrases with only one part being temporal, for 
example those in (4). 

(4) a. konshu-no uchiawase "this week-GEN meeting" 
this week's meeting vs the meeting this week 
b. getsuyobi-no uchiawase "Monday-GEN meeting" 
=> Monday's meeting vs the meeting on Monday. 

At present our rules translate most Japanese complex temporal noun phrases, and all tempo- 
ral noun phrases occurring inside larger non-temporal noun phrases as noun phrases in English, 
but as we identify more criteria for choosing between adverbials and noun phrases they will be 
added to our module. 

Note that we have similar sets of rules to those presented in Figure 1 for other time combi- 
nations, such as: the January before last (year + month), next Saturday (week + day) and so 
on. 

3 Temporal Adverbials 

Japanese temporal adverbials have the same structure as noun phrases, that is any number of 
modifiers, followed by a noun head, followed by a post-positional particle. The fact that the 
constituent is an adverbial is determined by the choice of postpositional particle, the semantic 
attribute of the head noun, and the structure of the sentence. As well as translating Japanese 
adverbials as adverbials it is often necessary to translate Japanese temporal phrases into English 
adverbials, as discussed in Section 2.2. In such cases, we need to select which preposition to 
use. In the next section we outline our algorithm for choosing prepositions, a sub-section of the 
module for translating temporal expressions. 

5 We are indebted to Tim Baldwin for reminding us of this. 
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Temporal-Noun 




Figure 2: Noun Type Hierarchy 



3.1 Choice of Preposition 

The main problem for generating English prepositions is deciding between on, at and in. Other 
prepositions, such as before or during have Japanese equivalents (normally functional nouns or 
post-positional particles) which can be used to select them. The two most widely used particles 
in Japanese for temporal (and locative) expressions are ni and, to a lesser extent, de, either of 
which can be translated as on, at or in. To choose between these three alternatives, we need 
to consider both the semantic attribute of the head noun and its dependents in English. We 
use the semantic attribute system for Japanese analysis of [Ikehara et al., 1997], augmented 
with several types defined especially in English generation. The relevant parts of the hierarchy 
are shown in Figure 2. The nodes underlined are those that appear in the Japanese semantic 
attribute system. 

Our type hierarchy matches that suggested by [Flickinger, 1996], and we adopt his nomen- 
clature. The hierarchy presented here is very similar, but includes deictic-day nouns and 
more branches under the day and non-day /hour node. The new nodes are needed in writing 
the rules to translate complex temporal noun phrases outlined in Section 2.2, and in choosing 
prepositions, as will be shown below. 

The hierarchy is loosely based on the choice of preposition for temporal adverbials formed by 
a prepositional phrase followed by an unmodified noun phrase headed by each type, deictic-day 
nouns take no preposition, they form adverbials by themselves. 

Noun phrases headed by day nouns will take the preposition on. The preposition is optional 
for noun phrases headed by day-of-week nouns in American English. Noun phrases headed 
by day-of -month nouns can take months as specifiers in Flickingcr's analysis, cardinal-day 
nouns can only take months, while ordinal-day nouns can take months or the or both. 

Noun phrases headed by hour nouns, including numbered-hour nouns and time-of-day 
nouns like noon and dawn are selected by the preposition at. The common denominator seems 
to be the idea of a precise moment in time. 

The remainder of noun phrases headed by temporal nouns are selected by the preposition 

in. 

Our algorithm for choosing whether to use a preposition, and if so which preposition to use, 
is given in Figure 3. 6 The algorithm is used to convert noun phrases headed by temporal nouns 
to adverbials, and has been extended to include noun phrases of the form A of B where A is 

6 In this algorithm, we treat the narrowest time unit as the head of temporal special compound noun phrases: 
for example, the heads of February the 19th and the 19th of February will both be 19th a cardinal-day noun. 
For other purposes, such as pluralization, the rightmost noun is treated as the head. 
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If equal to deictic-day (today, tomorrow, yesterday) or tonight 
or (premodified by one of the deictic terms this, that, last, next 
or one of the deictic-day nouns 
or a quantifier such as every, some) 
or post modified by ago, later 
=>■ adverbial is a noun phrase (no preposition) 
else create a prepositional phrase headed by: 

If head is (hour or night and not modified) or beginning, end: 
at 

Else-If head is day or (period-of -day and modified) 

=>■ on 
Else 
=> in 

Figure 3: Preposition choice for temporal adverbials 

beginning, middle, end and B is a temporal noun phrase. There are also some event nouns that 
could be analyzed as heading temporal noun phrases that can appear in temporal adverbials, 
such as opening in at the opening [of the Osaka Stock Exchange], we would like to extend our 
analysis to include them. 

on is optional for modified period-of -day nouns like Friday morning in at least one author's 
idiolect, and for day-of-week nouns in general in American English. In addition to dialect the 
choice depends on domain and genre (stock reports and speech tend to omit on). Depending 
on the users' requirements, some finer gradations may need to be made. By default we always 
generate on when it is considered optional, under the assumption that it is easier to delete it 
afterwards than put it in. 

3.2 Deictic Concerns 

There is another concern for a practical machine translation system, and that is the different 
choice of temporal expressions in different cultures. To take a well defined example, there is a 
marked difference between temporal expressions in Japanese and English on-line stock market 
reports in the Nikkei Biz Database. 

To give three examples: in the Japanese reports, days are given as days of the month while 
in English they are given as days of the week (5); expressions such as shumatsu "week end" 
are also translated as week days (6); and periods within the day are anchored to week day's in 
English but not in Japanese (7). 

(5) ... beikoku-jikan-no 12-13-nichi- ni hirakareru FOMC . . . 
American time-GEN 12- 13- day- on /at /in hold FOMC 

. . . the U.S. FOMC meeting Tuesday and Wednesday . . . 

(6) zen-shumatsu -no shikago nikkci-heikin-sakimono-daka 
last week end-GEN Chicago Nikkei average futures high 
Nikkei 225 futures gained in Chicago last Friday 

(7) daisho-shusei-wa macbike -ni kake yasuneken-de momi-au 
Osaka Stock Exchange-TOP morning close-DAT about low range at struggle 

Monday morning trading was confined to a boxed range at slightly lower levels 

For a machine translation system to be useful in such a domain it must be able to make 
these conversions. The conversion of dates to days is relatively simple and could theoretically 
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be done as part of a separate pre or post editing process. Changing stock market jargon to 
mornings and afternoons requires knowledge of the domain, while converting week beginnings 
and ends to days requires not just knowledge of the actual date, but also on which day the 
stock market was open, as, for example, the last trading day in the week is not necessarily a 
Friday (Friday may have been a holiday). 

We have added a special, domain-triggered section to our temporal expression transfer 
module that converts dates to days, and appropriately generates period of day expressions. It 
uses common date conversion routines, combined with the header date and time information 
available in the on-line market reports themselves. We are in the process of extending the 
module to handle market opening and closing date information. 

Extensions like these are a small step into translation based on domain understanding, which 
we hope will be the start of a long, but interesting journey. 

4 Conclusion 

In this paper we present an outline of rules for translating Japanese temporal phrases to English. 
Because the rules only apply to a specific limited class of input, and there are many lexical 
idiosyncrasies, we argue that these rules are better thought of as a separate module, although 
integrated with the entire system. We also emphasize the need to consider the context of 
temporal expressions, giving the example of Japanese and English Stock market reports, where 
knowledge of the date the report was prepared is essential to translate it. 

Acknowledgements 

The authors thank Tim Baldwin, Rodney Huddleston, Yukie Kuribayashi, Roland Sussex and 
the members of the NTT Machine Translation Research Group for their disscussion and com- 
ments; as well as Satoshi Mizuno and Shinsukc Okuyama of NTT Software for their help in the 
implementation. 

References 

[Chesterman, 1991] Andrew Chesterman. 1991. On definiteness: A study with special reference to 
English and Finnish. Cambridge studies in linguistics; 56. Cambridge University Press, Cambridge. 

[Flickinger, 1996] Dan Flickinger. 1996. English time expressions in an HPSG grammar. In Studies on 
the Universality of Constraint- Based Phrase Structure Grammars, Technical Report 06044133, pages 
1-8. Osaka University, Japan. 

[Ikehara et al., 1991] Satoru Ikehara, Satoshi Shirai, Akio Yokoo, and Hiromi Nakaiwa. 1991. Toward 
an MT system without pre-editing - effects of new methods in ALT-J/E-. In Third Machine 
Translation Summit: MT Summit III, pages 101-106, Washington DC. (cmp-lg/9510008). 

[Ikehara et al., 1996] Satoru Ikehara, Satoshi Shirai, and Francis Bond. 1996. Approaches to dis- 
ambiguation in ALT-J/E. In International Seminar on Multimodal Interactive Disambiguation: 
MIDDIM-96, pages 107-117, Grenoble. 

[Ikehara et al., 1997] Satoru Ikehara, Masahiro Miyazaki, Satoshi Shirai, Akio Yokoo, Hiromi Nakaiwa, 
Kentaro Ogura, Yoshifumi Ooyama, and Yoshihiko Hayashi. 1997. The Semantic System, volume 1 
of A Japanese Lexicon. Iwanami Shoten, Tokyo, (in press). 

[Quirk et al., 1985] Randolph Quirk, Sidney Greenbaum, Geoffrey Leech, and Jan Svartvik. 1985. A 
Comprehensive Grammar of the English Language. Longman, Essex. 

[Ross, 1995] John Robert Ross. 1995. Defective noun phrases. In Papers from the Regional Meeting 
of the Chicago Linguistic Society, volume 31, pages 398-440. University of Chicago. 



8 



